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Abstract — Network calculus is a min-plus system theory for 
performance evaluation of queuing networks. Its elegance stems 
from intuitive convolution formulas for concatenation of de- 
terministic servers. Recent research dispenses with the worst- 
case assumptions of network calculus to develop a probabilistic 
equivalent that benefits from statistical multiplexing. Significant 
achievements have been made, owing for example to the theory 
of effective bandwidths, however, the outstanding scalability set 
up by concatenation of deterministic servers has not been shown. 

This paper establishes a concise, probabilistic network calculus 
with moment generating functions. The presented work features 
closed-form, end-to-end, probabilistic performance bounds that 
achieve the objective of scaling linearly in the number of 
servers in series. The consistent application of moment generating 
functions put forth in this paper utilizes independence beyond the 
scope of current statistical multiplexing of flows. A relevant addi- 
tional gain is demonstrated for tandem servers with independent 
cross-traffic. 

I. Introduction 

Network calculus [1], [2] is a theory of deterministic 
queuing systems that is based on the early (cr, j o)-calculus for 
network delay [3] and on the work on generalized processor 
sharing in [4]. Founded on min-plus algebra [5] it relates to 
classical system theory, where output and concatenation of 
systems can be derived by intuitive convolution formulas [6], 
[7], [8], [9], [10]. Owing to these network calculus allows 
for an efficient analysis of networks of queues. It facilitates 
the derivation of worst-case backlogs and delays by applying 
deterministic upper envelopes on traffic arrivals and lower 
envelopes on the offered service, so-called arrival and service 
curves [6], [4]. 

Although the fundamental performance bounds provided 
by deterministic network calculus were proven to be tight, 
that is they are attained for certain sample paths [2], the 
occurrence of such worst-case events is usually rare, especially 
if aggregated traffic is considered. Statistical multiplexing of 
independent flows is known to smooth out burstiness with a 
high probability, whereas bursts are cumulated by determin- 
istic network calculus, resulting in pessimistic bounds and 
overestimation of resource requirements. This has motivated 
considerable research in recent time aiming at a probabilistic 
equivalent which efficiently utilizes statistical multiplexing 
while preserving powerful concatenation properties. 

This work was supported by the DFG under an Emmy Noether grant, by the 
Institute Mittag-Leffler under the program on Queueing and Teletraffic Theory, 
by the NSF under grant CNS-0435061, and by two NSERC Discovery grants. 



Introducing independence of flows to the framework of net- 
work calculus tightens assumptions, such that traffic sources 
must not be adversarial jointly, although independent incre- 
ments are not assumed and sources may still create worst- 
case traffic patterns on their own. Adversarial sources and 
the worst-case patterns generated by extremal shape controlled 
traffic were subject to detailed investigations [11]. The perfor- 
mance of multiplexing leaky bucket and peak rate constrained 
on-off sources was investigated in [12] and further elaborated 
in [13], where buffer and server were decoupled. General 
models for regulated traffic were investigated in [14]. A cor- 
responding framework for bufferless multiplexing was derived 
in [15] and for buffered multiplexing in [16]. Improvements 
and a generalization of the bounds in [16], [17] were provided 
in [18]. 

A more generic class of traffic arrivals was investigated 
in [19] where an exponential bound on the burstiness was 
applied. The approach was extended to the concept of stochas- 
tically bounded burstiness in [20] and generalized in [21]. 
Related models were also used in [22], [23], [1], [24], where 
the pioneering work in [23], [1] carried the deterministic 
(cr, p) -calculus from [3] forward to a stochastic setting using 
(c(#), p(f?))-envelopes of the moment generating function 
(MGF) of arrival processes. Backlog, delay, and output bounds 
for work-conserving links were derived from Chernoff's the- 
orem in [23], [1]. A related theory for intree networks was 
developed in [25] using stochastic ordering. 

Generally, probabilistic backlog, delay, and output bounds 
for a single server facilitate the iterative application to net- 
works of queues, using respective output bounds as input 
bounds for subsequent downstream servers and adding per- 
server delay and backlog bounds up. Unfortunately, this ap- 
proach results in end-to-end performance bounds which are 
loose and decay rapidly with the number of servers that are 
traversed [26]. Few models are, however, known that allow 
concatenating probabilistic service curves to derive end-to-end 
probabilistic network models. 

An important continuation of the work in [23] was put for- 
ward in [1] applying the more general f(9, i)-upper constraint 
for traffic sequences and a corresponding lower constraint for 
the service offered by servers with time varying capacity. 
The proposed framework provides calculation formulas for 
stochastic backlog and delay bounds as well as the funda- 
mental concatenation property. Related results for max-plus 



systems [5] were derived in [27] and applied to tandem queues 
using recursive equations to describe departure times which 
were then solved in matrix formulation. 

A significant, alternative step towards a probabilistic, end- 
to-end network calculus was presented in [28], where the 
concept of effective envelopes, which are violated at most 
with a defined probability, was derived using MGFs and the 
Chernoff bound. Related theories use the concept of statistical 
envelopes [29] and apply second order statistics [30]. The 
concept of effective envelopes was further developed in [31], 
[32] and [33] where the relation to the theory of effective 
bandwidths, for example [1], [34], was shown. MGFs are used 
extensively by the theory of effective bandwidths from where a 
variety of useful traffic models are known, see for example [1], 
[34], [28], [23], [16], [33] and references therein. 

Effective envelopes are conceptually close to deterministic 
envelopes as applied by network calculus, resulting in an 
intuitive continuation including the important concatenation 
theorem. However, the derivation of violation probabilities 
for performance bounds as well as for concatenated effective 
service curves has proven to be hard [33] and requires ad- 
ditional assumptions, for example bounds on the duration of 
busy periods [33] or explicit dropping of traffic that violates 
a certain delay bound [35]. Once effective envelopes are fixed 
no further statistical multiplexing gain is accessible and worst- 
case assumptions apply, such that independence is not required 
any longer. 

In recent work [26] the Landau notation O was introduced 
to the field of network calculus to analyze the scaling of end- 
to-end performance bounds in the number of servers in series 
n. This important aspect provides comparability of different 
models and improves their understanding. The standards set 
by deterministic network calculus are a scaling in (D(n 2 ) for 
bounds that are derived iteratively compared to 0(n) obtained 
from end-to-end convolved service curves [2]. In [26] an 
outstanding scaling of statistical bounds in O(nlogn) was 
achieved without assuming statistical independence, where rate 
correction terms were introduced which dispense with busy 
period bounds. 

Until now it has, however, not been proven that end-to-end 
probabilistic performance bounds can actually scale as well 
as deterministic bounds. Based on the pioneering stochastic 
{c{9) 1 p(#))-calculus [23] and following the suggestion in [1] 
we develop an intuitive, system-theoretic formulation of a 
probabilistic network calculus with MGFs which achieves the 
target scaling in 0(n) without requiring assumptions beyond 
existence of respective MGFs and statistical independence. 

The paper is organized as following: We derive correspond- 
ing MGFs of relevant min-plus operations in Sect. |ll| In 
Sect. [Ill] we develop a probabilistic network calculus based on 
the results from Sect, ^fl The scaling properties of performance 
bounds are proven in Sect. II VI and numerical results are shown 
in Sect.|V] Throughout the paper we use a discrete time model 
t 6 No = {0,1,2,...}. 



II. Moment Generating Functions 

The particular strength of network calculus is the efficient 
analysis of servers in series. Tandem servers can be concate- 
nated using min-plus convolution and performance bounds 
can be derived based on min-plus de-convolution. Min-plus 
convolution of real- valued, bivariate functions x(s,t) and 
y(s,t) [1] and a corresponding min-plus de-convolution are 
defined for t > s > as 

(x®y)(s,t) = inf [x(s, t) + y(r, t)] 

T£[s,t] 

(1) 

(x0y)(s,t) = sup [x{T,t) - y{T,s)]. 

t£[0,s] 

Note that the defined operators <g> and are not commutative. 

The applicability of network calculus suffers from the worst- 
case modelling that is used. The deterministic view totally 
neglects statistical multiplexing which, however, potentiates 
significant resource savings. Related aspects are well under- 
stood for single servers and the theory of effective bandwidths 
provides a general traffic model which makes the statistical 
multiplexing gain among independent, multiplexed flows ac- 
cessible. Effective bandwidths resort to MGFs which for a 
random variable X are defined for any 9 as 

M x {9) = Ee ex 

where E is the expectation of its argument. In the sequel we 
use the notation M x (9) = M x (-9) = Ee~ ex . 

MGFs exhibit a number of useful properties: Given two con- 
stants ci and c 2 it is known that M Cl+C2X (9) = e 6ci M x (c29). 
Further on, for two random variables X and Y and for 9 > 
it can be easily verified that 

M min[ x,Y](0) < min[Mx(0), M Y (fi)] 
M max[x , y] (0) < mm[M x (9),M Y (9)} 

since for 9 > it holds that M min [ X ,Y] (9) < M x (9) as well as 
Mmin[x,Y] (9) < M Y (9). For the sum of independent random 
variables X and Y it is well-known for any 9 that 

M x+Y (9) = M X (9)M Y (9) 

M X ^ Y (9) = M X (9)M Y (9). 

Lastly Chernoff's theorem builds on MGFs to estimate the 
violation probability of an effective bound x for a random 
variable X. For any x and all 9 > it is known that 

P{X >x}< e- ex Ee ex . 

Throughout the remainder of this section we proof an 
important connection between min-plus algebra and conven- 
tional algebra established by MGFs [27], [1]. We will show 
that MGFs transform min-plus convolution and de-convolution 
into corresponding operators * and o in conventional algebra 
which for real-valued, bivariate functions x(s,t) and y(s,t) 
are defined for t > s > as 

t 

(x * y){s,t) =^Tx(s,T)y(T,t) 

T w 

(xoy)(s,t) = ^2x(T,t)y(r,s). 

T = 



We will sometimes use the operators and o for s > t > 
whereupon the supremum respectively the sum is evaluated 
for all r £ [0, t] which for notational convenience is omitted 
in (|TJ and @. 

The following lemmas act on a suggestion made in [1] and 
are in accordance with a similar investigation of max-plus 
systems in [27]. 

Lemma 1 (Moment Generating Functions of (g> and 0) 

Let X(s, t) and Y(s, t) be independent random processes. 
The MGF of the min-plus convolution respectively the MGF 
of the min-plus de-convolution of X(s,t) and Y(s,t) are 
upper bounded according to 

M ( xev)(0,«,i) < (Mx(0)*My(0))(«,t) 
M (X0Y )(O,s,t) < (M X (0) o My (*))(*, t). 

Proof: The MGF of the min-plus convolution of X(s, t) 
and Y(s, t) is 

An upper bound on the MGF follows for any 8 as 

Mr—Cfl « ^ <F r P - e (^(^)+ y (r,t))l 



i(x<gy)(0,s,i) < E sup [e 

re[»,t] 



< 



-e(X(s,T)+Y(r,t)) 



EE' 

r— s 
t 

= V* Ee -e(x( a ,T)+y(T,t))_ 

T — S 

Applying (|3} and l|4} completes the proof. The proof for min- 
plus de-convolution is an immediate variation. ■ 
Unlike [28], [33], where MGFs and the Chernoff bound are 
used to derive effective envelopes before min-plus convolution 
and de-convolution are utilized, we use only MGFs and apply 
the Chernoff bound in a final step to derive bounds on 
the results of min-plus convolution and de-convolution. This 
approach maintains random processes as long as possible, 
which in case of statistical independence is beneficial for the 
quality of performance bounds. 

Lemma 2 (Envelopes for and 0) Let X(s,t) andY(s,t) 
be independent random processes. Effective lower envelopes 
on the min-plus convolution respectively upper envelopes on 
the min-plus de-convolution of X(s,t) and Y(s,t) that hold 
at least with probability 1 — e where e € (0,1] are given by 



> 1 -£, 



p<^ (i«y)( s ,t) > 



sup 

0e(o,oo) 



-(m £ -ln(M x (0)*My(0))(M)) 



P{{X(Z)Y){s,t) < 
T 



inf 

ee(o,oo) 



(ln(M x (0) o My(0))(M) -lnt 



> 1 



Proof: Boole's inequality yields 

P{(X <E>Y)(s,t) < z(s,t)} 

= P{ inf [X(s,r)+Y(T,t)}<z(s,t)} 
t 

= P[\J{X( S ,T) + Y(r,t) <z(s,t)}} 

T — S 

t 

<]Tp{x( s ,T) + y(T,i)<z( s ,i)}. 

T — S 

Applying Chernoff's bound it follows for all 8 > that 

P{(I«F)(s,() < z{s,t)} 
t 

< Y, P{~X(s, t) - Y(t, t) > -z(s, t)} 

T — S 
t 

< y^ e 9z{a,t) Ee -0{X{a,T)+Y{T,t)) _ 



Setting the right hand side equal to e we can solve for z(s, t) 
and obtain 

z(M ) = Ubxe-u(j2 Ee-^ X ^ +Y(T ^)) 



for all 8 G (0, oo) where we choose the optimal 8. The proof 
for min-plus de-convolution is an immediate variation. ■ 
For completeness we note that related formulas as pro- 
vided by Lemma ^ an d Lemma |2] can be derived with- 
out assuming independence in which case these operations, 
however, become costly. From Holder's inequality E|AB| < 
(E\A\ K ) 1 / K {E\B\ U ) 1 /" where k, v > 1 and 1/k + l/v = 1 it 
follows for the sum of two random variables X and Y with 
A = e ex and B = e ±eY for all 8 that 

M x+ y(0) <(M x (^)) 1 / K (My(^)) 1 ^ 
M*_y(0) <{U x {k,8)) 1 I k {My{v8)) 1/v 

which replaces (|3j whenever independence cannot be assumed. 
Related theories which do not require independence are shown 
in [19], [20] and are significantly further advanced for effective 
envelopes [28], [31], [26], [33], [32], whereas this work 
focuses on exploiting statistical independence. 

The dual operators * and o form the basis for an intuitive 
probabilistic network calculus with MGFs which features 
the efficient utilization of statistical multiplexing within a 
framework for effective concatenation of tandem servers. Note 
that Lemma Q can be applied iteratively to more than one 
min-plus convolution or de-convolution, for example an upper 
bound on the MGF of (A" (Yi • • • Y n ))(s, t) is given 
by (M x (8) o (M Yl (9) * • • • *M Y „(9)))(s,t). This property is 
an essential prerequisite for performance analysis of tandem 
servers as shown in the following sections. 



III. Probabilistic Network Calculus 

In this section we recall fundamental properties of network 
calculus, see for example [1], and apply the results from 
Sect. lU to derive probabilistic performance bounds. 

Arrival and departure processes are described by real- 
valued, cumulative functions A(Q,t) and D(0,t) respectively 
which represent the amount of data seen in the interval (0, t]. 
We assume that there are no arrivals in the interval (— oo,0]. 
Clearly A(0, t) and D(0, t) are nonnegative and increasing in 
t. The amount of data seen in an interval (s,t] is denoted by 
A(s, t) = A(0, t) - A(Q, s) and D(s, t) = D(0, t) - D(0, s), 
where A(s,t) and D(s,t) are nonnegative, increasing in t, 
decreasing in s, and A(t, t) = and D(t, t) = for all t. 

Definition 1 (Dynamic Server) Assume A(0,t) and D(0,t) 

are the arrival respectively departure process of a lossless 
server. Let S(s, t) for t > s > be a random process that is 
nonnegative and increasing in t. The server is called a dynamic 
server S(s, t) if for any fixed sample path it holds for all t > 
that 

D (0,t) > (A S)(0,t). 

The definition of dynamic server is according to [1], [36], [37]. 
It extends the framework of deterministic network calculus 
using bivariate functions. Note that while A(s,t) is defined 
as A(s, t) = A(0, t) — A(0, s) similar assumptions are not 
generally made for S(s,t). 

Example 1 (Work-Conserving Server) Consider a work- 
conserving server with arrival and departure process A(0,t) 
and D(Q,t) respectively. For any t > fix t = sup[s 6 
[0,t] : D(0, s) = A(0,s)], that is t is the beginning of the 
last backlogged period before t if any or otherwise t — t. Let 
S(t, t) be a random process that denotes the service offered 
by the server in the interval (r, t] under the condition that r is 
the beginning of the last backlogged period before t. Assume 
that S(r,t) is nonnegative, increasing in t and S(t,t) = 0. 
From the work-conserving property we have for any fixed 
sample path that D(0, t) = D(0, t) + S(t, t), that is the server 
is non-idling and uses the entire available service S(t, t) to 
serve backlogged data. With D(0, r) = A(Q, r) it follows that 
D(0, t) = A(0, t) + S(t, t) whereby Def. □ is fulfilled. 

Performance bounds for dynamic servers have been obtained 
in [1], [36], [37] where it has also been shown that dynamic 
servers in series can be effectively transformed into an equiva- 
lent, single dynamic server. Thus, known performance bounds 
extend immediately to tandem servers. For completeness Th.^ 
and Th.|2]rephrase these results formally and show correspond- 
ing MGFs respectively probabilistic bounds. In the sequel 
upper indices indicate servers respectively belonging arrival 
and departure processes of a server. 

Theorem 1 (Concatenation) Consider two dynamic servers 
S 1 (s,t) and S 2 (s,t) in series. There exists an equivalent, 
single dynamic server S(s, t) for t > s > where 

S( s ,t) = (S 1 <g> S 2 )(s,t). 



Assume the service offered by each of the servers has MGF 
M51 {9, s, t) respectively M 52 (0, s, t). Under the assumption 
of statistical independence of S 1 {s,t) and S 2 {s,t) the MGF 
of the equivalent, single dynamic server is upper bounded for 
t , > s > according to 

M s (0,s,t) < (M sl (0)*M s2 (0))(s,t). 

Proof: Let A l (0,t) and £>*((),£) be the arrival re- 
spectively departure process of server i, where A l (0,t) = 
Z? l_1 (0,t). For any fixed sample path it follows from Def. Q 
for all t > that 

3t G [0, t] : D 2 (0, t) - A 2 (0, t) > S 2 (t, t). 
In the same way it holds for any r > that 

3s e [0,t] : D 1 (0,t)-A l (0,s) > S 1 {s,t). 
Taking the sum yields for all i > that 

3rG[0,t],3s6[Q,T] : D 2 (0,t)-A 1 (0, s) > S 1 (s,T)+S 2 (T,t) 

where the right hand side is the min-plus convolution of S 1 
and S 2 which proves the first part. An upper bound on the 
MGF of min-plus convolution of two statistically independent 
random processes (S 1 <g> S 2 )(s,t) follows from Lemma as 

(M sl (0)*M s2 (0))(M). ■ 
Two performance measures, namely backlog and delay as 
defined below, are of particular interest in the context of 
networking and networked applications. 

Definition 2 (Backlog and Delay) Let A(0,t) and D(0,t) 

be the arrival respectively departure process of a lossless 
server. The backlog at time t > is 

b(t) = A(0,t) -D(0,t). 

Assuming first-in first-out ordering the delay at time t > 
is defined as 

d(t) = inf [s > : A(0, t) - D(0, t + s) < 0]. 

Theorem 2 (Backlog, Delay, and Output Bounds) 

Consider a dynamic server S(s,t) with arrival process 
A(s,t). The backlog at time t > is upper bounded 
according to 

b(t) <(A(Z>S)(t,t). 

Assuming first-in first-out scheduling the delay at time t > 
is upper bounded according to 

d(t) < inf[s > : (A S) (t + s, t) < 0]. 

Note that the delay bound formulation using min-plus de- 
convolution extends the domain of the definition in Q which 
is accounted for by taking the supremum over the adapted 
interval [0, t], that is (A S)(t + s, t) — sup TG [ t i [A(t, t) — 

S(T,t + S )]. 

The departure process D(s,t) is upper bounded for any 
t > s > according to 

D{s,t) < (A®S)(s,t). 



Assume the arrival process has MGF Ma(0, s,t) and the 
service offered by the server has MGF Ms(0, s, t). Under the 
assumption of statistical independence of A(s,t) and S(s,t) 
an upper bound on (A S)(s,t) with violation probability 
e £ (0, 1] is given by 

p((A0S)(«,t) < 



inf 

9€(0,oo) 



-(ln(M A (9)oM s (e))(s,t)-ljxe) 



> 1 -e 



which provides an upper bound on (A0S)(s, t) that is violated 
at most with probability e and in turn establishes probabilistic 
backlog, delay, and output bounds. 

Proof: Let A(0, t) and D(0, t) be the arrival respectively 
departure process of the server. It follows from Def. [2 for any 
fixed sample path and all t > that 

3t e [0, t] : D(0, t) > A(0, r) + S(r, t) 
<^3r e [0, t] : A(G, t) ~ D(0, t) < A(0, t) - A(0, r) - S(t, t) 

where A(0,t) — D(0,t) = b(t) is the backlog at time t and 
A(0,i) — A(Q,t) — A(r,t), Reformulation using min-plus 
de-convolution completes the proof of the backlog bound. 

It follows from the definition of delay and Def. ^ f° r an Y 
fixed sample path and all i > that 

d[t) = inf [s > : A(0, t) - D(0, t + s) < 0] 
=^d(t) < inf [s > : A(0, t) - A(0, r) - S(t, t + s) < 0, 

Vr e [0,t + s]}. 

Since A(0,t) - A(0,r) < for r > t and generally 
S(r,t + s) > for all t E [0,t + s] it is sufficient to just 
consider all t 6 [0,t] where A(0,t) - A(0,r) = A(r,t). 
Reformulation using min-plus de-convolution over the interval 
[0, i] completes the proof of the delay bound. 

Lastly, it follows from Def^for an y fixed sample path and 
alU > s > that 

3t e [0, s]:D(0, s) > A(0, r) + 5(r, s) 

e[0,s]:D(0,t)-D(0, s) < D(0, t) - A(0, r) - 5(r, s) 
=Hr e[0,s] :£>((), t) -D(0,s) < A(0, t) - A(0, r) - S'^s) 

since A(0, t) > D(0, f) for causality. With D(0, t)-D(0, s) = 
D(s,t), A(0,t) - A(0,t) = A(r, t) and using min-plus de- 
convolution the proof of the output bound is completed. 

The respective probabilistic upper bound on the result 
of min-plus de-convolution of two statistically independent 
random processes (A S)(s,t) follows from Lemma|2] ■ 

After showing the basic network calculus theorems for 
dynamic servers the remainder of this section addresses sched- 
ulers which implement respective properties. Def.[0is trivially 
fulfilled by deterministic servers [1]. Beyond this obvious 
class an example has been given in [36] for work-conserving 
servers with time-varying capacity and in [1], [36], [37] for 
the dynamic service curve based earliest deadline scheduler. 
The corresponding MGF of the offered service is, however, not 



derived in [1] and probabilistic performance bounds obtained 
from MGFs are provided in [23], [1] for the class of work- 
conserving constant rate servers only. 

In the following we derive the leftover service under a 
general scheduling model which does not make assumptions 
about the order in which flows are served with respect to 
each other. Thus, it is conservative for most schedulers and 
it includes widely used aggregate first-in first-out scheduling 
as well as priority scheduling. The approach that is applied by 
deterministic network calculus is based on per-flow service 
curves which are derived by subtracting service that is con- 
sumed by cross-traffic [1], [38], [2]. When applying effective 
envelopes to model cross-traffic, probabilistic per-flow service 
curves arise naturally from deterministic servers [33], [32]. 
Similar results are obtained here for dynamic servers, however, 
using MGFs. The following derivations are made for two 
flows or aggregates of flows where lower indices denote flows 
respectively the service which is provided to a particular flow. 
Note that any of the two flows can actually consist of an 
arbitrary number of micro-flows. 

Proposition 1 (General Scheduling Model) Consider two 
flows with arrival processes Ai(s,£) and A2(s,t) that are 
scheduled at a work-conserving server with service process 
S(s, t) as defined in Ex. Assume that S(s, t) is nonnegative, 
increasing in t, and S(s, s) = 0. Flow two sees a dynamic 
server 

S 2 (s,t) = max[Q,S(s,t) - A x (s,i)]. 

Assume A\(s,t) and S(s,t) are statistically independent and 
have MGF Ma ± (0, s, t) and Ms(#, s, t) respectively. The MGF 
of the dynamic server seen by flow two is upper bounded for 
> and t> s > by 

Ms 2 (0,s,t) < mm[l,M s (0,s,t)M Al (6,s,t)}. 

Proof: Let Aj(0, t) and -Di(0, t) be the arrival respec- 
tively departure process of flow i £ [1,2]. For any fixed sample 
path and any t > there exists s — sup[r G [0, t) : -D;(0, r) = 
A,*(Q,r),Vi <E [1,2]], that is s is the beginning of the last 
backlogged period before t if any or s — t otherwise. For a 
work-conserving server and the defined s it holds for any fixed 
sample path that 

Di (0, t) + D 2 {0, t) = Di (0, s) + D 2 (0, s) + S(s, t) 
= A 1 (0,s) + A 2 (0,s) + S{s,t). 

With £>i(0, t) < Ai(0,i) for causality we obtain 

D 2 (0, t) > A 2 (0, s) + S(s, t) - (A x (0, t) - Ax(0, a)). 

Since D 2 (0,t) > D 2 (0,s) = A 2 (0,s) it follows that 

D 2 (Q, t) > A 2 (0, s) +max[0, S(s, t) - (Ai(0, t) - Ai(0, s))\. 

With Ai(0, t) - Ai(0, s) = Ai(s, t) the first part of the proof 
is completed. The MGF for statistically independent random 
processes A%(s, t) and S(s,t) follows from (|2j and ■ 



IV. Scaling of End-to-End Performance Bounds 

The scaling of end-to-end backlog and delay bounds in the 
number of servers in series, denoted by n, is a crucial aspect 
in network performance analysis. The benchmark set by deter- 
ministic network calculus for leaky bucket sources is a scaling 
in 0(n 2 ) for summation of per-server bounds and 0(n) for 
bounds obtained from end-to-end network service curves [26], 
[2]. This result highlights the importance of the concatenation 
theorem. Recently, probabilistic performance bounds that scale 
in O(nlogn) were obtained in [26] using effective network 
service curves and the exponentially bounded burstiness traffic 
model from [19]. We show that actually a scaling in 0(n) can 
be achieved under the assumption of independence. Like [26] 
we consider the scenario shown in Fig. [2 where performance 
bounds are derived for the through flows that traverse all n 
servers in series. Cross-traffic joins and leaves at each server 
where we make the assumption of statistical independence. 



Fig. 1. Tandem servers with cross-traffic 

We apply the (<r(9), p(f?))-traffic model from [1] to derive 
closed-form solutions for end-to-end performance bounds. An 
arrival process A(s,t) with MGF M A (9, s, t) is (a(9),p(9))- 
upper constrained for some 9 > if for all t > s > it holds 
that 

^M A (6,s,t)<a(0)+p(6)(t-s). (5) 

A related quantity l/(9t) In Ma(9, 0, t) is known as effective 
bandwidth, for example [1], [34]. A wide variety of traffic 
models are known from related theories, including periodic 
sources, fluid sources, on-off sources, and regulated sources, 
see for example [1], [34], [28], [23], [16], [33] and references 
therein, which can be described using the (a(9), p((9))-traffic 
characterization in 0. 

Note that the right hand side of (0 depends only on the 
length of the interval (s, t], that is 5 = t — s, but not on s or t 
itself. For univariate functions x and y with x(t — s) = x(s, t) 
and y(t — s) = y(s, t) for all t > s > the operators * and o 
follow from (0} with 5 = t — s > as 



(x * y)(6) = ^2x(6- r)y(r) 

T = 

oo 

(x o y){5) = x(6 + r)y(r) 



r=0 

where we let s — > oo in case of the univariate o operator. 
Note that the univariate convolution operator * as opposed 
to its bivariate equivalent is commutative. We will use the o 
operator also for 5 < whereupon the sum is evaluated for 
all r E [-S, oo). 



The proof of Th. [5] which is shown in the sequel, uses 
the following lemma for composition of univariate * and o 
operators. 

Lemma 3 (Composition of univariate * and o) Let x{$), 
y(S), and z(6) be univariate, real-valued functions. It holds 
that 

(xo(y* z))(5) = {{x oy)o z)(5). 

Proof: From the definition of univariate operators * and 
o it follows that 

OO Tl 

(x o (y * z))(6) = Y X ( S + T i) X! y ( Tl ~ T 2) Z ( T 2) 



Tl=0 



T 2 =0 



Tl— U\ ,U2~>0'-Ui J r U2—Ti 

OO oo 

= ^ x(S + ui + u 2 )y(u 1 )z(u 2 ) 

U2—0 Ui— 

= ((xoy)oz)(S) 

which proves the composition rule. ■ 
The following theorem extends the results from [23], [1] to 
servers with cross-traffic in series and shows the linear growth 
of end-to-end performance bounds in the number of tandem 
servers n. For ease of presentation we investigate the case 
of homogeneous servers and flows. Solutions for the general, 
heterogeneous case follow in a similar way. 

Theorem 3 (End-to-End Backlog and Delay Bounds) 

Consider n work-conserving, constant rate servers in series, 
each with capacity C and independent (a c (9), p c (9)) cross- 
traffic under the general scheduling model. Consider an 
independent (o~(6), p(9)) aggregate of through flows that 
traverses all n servers in series. Any b and d are upper 
backlog respectively delay bounds with violation probability 
e e (0, 1] if for any 9 6 (0, oo) it holds that C > p(9) + p c (9) 
for stability and 

np(9)a c (9) nlB.7 — lne 



b > cj{9) + 
d> 



C- Pc (9) 9 
tj{9) na c (9) nki.7 — lne 



p{9) C - Pc (9) 



9p{9) 



i_ e -fp(«) 



l_ e -8(C-P(9)-/>c«>» 



and T 



where 7 

c-p c (e)- 

For n = 1 an improved condition under which d is a delay 
bound is 



d> 



a(9) + a c {9) ln 7 '-lne 



C - Pc {9) 9{C - Pc (9)) 



where 7' 



For n > 2 the conditions are 



l-e-eCC-pW-PcCS)) ■ 

a(9)+na c (9) n\nC-\m 



d> 



C- Pc {9) 9{C- Pc {9)) 



-8(C-p(9)-p c (0)) 

and d > ™ e -e ( c- PW - Pc{ e)) where ( 



(l+d/n) 1+d /" 
(d/n) d /™ • 



Clearly the conditions for backlog and delay bounds depend 
linearly on the number of servers in series n. The first two 
terms of the backlog bound can be interpreted as the burstiness 
of the through flows a(9) and the data accumulated due to 
the rate of the through flows p(9) during the latency induced 
by cross-traffic a c (9)/(C — p c (9)) at each of the n servers. 
Accordingly, the delay bound comprises the latency a(9) / (C~ 
p c {S)) which is due to the burst size of the through flows and 
the cumulated latency of the n servers induced by cross-traffic, 
that is n times a c (6)/(C - p c {9)). 

Proof: The end-to-end concatenation of n statistically 
independent dynamic servers is described by convolution of 
the respective MGFs, see Th. \l\ Using the univariate * and o 
operators with 5 = t — s and applying Lemma [3] the bounds 
from Th. |3 build on the following MGF 



M A0S (O,S) <(M A (6) o (M S i(0) * • • ■ * Ms» (*)))(*) 
= (M A (0) o M S i (6) o ••• o M s » (9))(S). 



(6) 



The MGF of (a (9), p(#))-constrained arrivals is upper 
bounded by M A (9, 6) < e ^W+p(0)S) for g > . With Prop.[T] 
and using the notation (x) + = max[0,x] the MGF of the 
leftover service of a work-conserving, constant rate server 
with capacity C and (a c (9), p c (9)) cross-traffic is bounded 
according to 

M S i(9,6) < e - ((c-P.(e))8-a a (e))+ = e -o(c- Pc (e))(S-T)+ 



<7c(8) 

With © it follows for (a(9),p(9)) through traffic that 



where T = c °Jp n ^ is a latency induced by the cross-traffic. 



oo oo oo 



ma s(m)<E---E E e 



U n —0 U2—0Ul—0 

e _ 9(C _ Pc( 0)) (Ml _ T) + e _0 ( C_p c (0))( U2 -T)_+ _ -6(C-p a {6)){u n -T) + 



and after some reordering we obtain 



-e({c-p a {e))(u l -T)+-p{e)(u l -T)) 



The sums J2^=o are divided into J^J^Iq 1 an ^ S^=rr] an< ^ 
solved for 9 > and p(9) > using geometric series which 
yields 

m-i 

e -e{{c- Pc {0)){u i -T)+-p{e){u i -T)) 



u i= 



-9p(8)T 



m-i 

E( 

Ui=0 



Jp(6)\Ui _ p -ep{9)T[ 



•pwm _ i 

s 0p(e) - i 



and under the stability condition C > p(9) + p c {9) 



E * 



-0{{C-p c {0)){u i -T)+-p{0){u i -T)) 



. e e(C-p(e)- Pc (e))T ( e -e(c- P (0)- P c(0))y 
u*=rri 

e -e(c-p(9)- Pc (e))(rTi-T) 



< 



1 



1 _ e -0(c-p(e)-p c (o)) - i _ e -d(c-p(e)- Pc (0)) 
such that 

M A0S (9,5) < e H°«»+P(e)(5+nT)) 

■ I _ e -e P (e)(T+i) 1 



I _ e -0p(0) l _ e -0(C-p(9)-p a (9)) 



(7) 



From Th. |2]it follows that b and d are a backlog respectively 
delay bound which are violated at most with probability e if 
for any 9 £ (0, oo) it holds that 



&>i(lnM A0 s(^O)-liie) 
h]xiM A0S (9,-d)-lae) <0. 



(8) 



Inserting into (|8j and reordering completes the first part 
of the proof. 

Note that the derivation of the delay bound evaluates larger 
intervals than stated in Th. |2] to simplify the derivation of 
a conservative, closed-form bound. Taking the corresponding 
restriction of the interval into account we have for n = 1 that 



9, 



0( tT (0)+p(0)( T -d)) e -0((C-p c (0))T-<Ta(0)) 



r=d 

e -0(C-p(0)-p e (0))d 



J(<j(8)+a c (0)-p(0)d)_ 



I _ e -0{C-p{0)-p c {0)) 



where 9 > and C > p(9) + p c (9) for stability. The delay 
bound follows immediately from (jSJl. 

For n > 2 we obtain from the first expression in using 
the commutativity of the univariate * operator that 



M 



A0Sl 



-d)) 



"3 «2 



E --EE' 

U n -i— U2=0u±=0 



-e((c-p c (e))ui-ffe(e)) 



e -0((c- Pc (.0))( U2 - ul )-<T C (.0)) _ _ _ e -e((c-p e (e))(r-« n _i)- - c (fl)) i 
After some reordering we find 



^fr + H ^ wm -l < 1 - e -'pW(r+i) 



1 _ e -9p(9) - 1 _ e -0p(fl) 



E E - 



-e(c-p(e)- Pc (e)) T 



With q = e -«(c~p(e)-p c (e)) the double sum becomes [39] 

1 N 



E 



E 

i>0:E? =1 -' 



9 



E 

T = d 



t + n 
n-1 



1 



(9) 



\ " 



P 



t + n — 1 
n - 1 



P Q 



Under the stability condition C > p{9) + p c (8) and for 9 > 
we choose p = 1 — q where p,q 6 (0, 1) and give (|9j a 
probabilistic interpretation: Consider a number of independent 
trials each having probability p of being a success. The 
probability to obtain the rt-th success exactly in the (r + ri)-th 
trial is given by the negative binomial distribution. It equals 
)p n Q T ■ Consequently, l|9} can be interpreted as l/p n 
times the probability that d + n or more trials are needed for 
n successes. 

The number of trials required to obtain n successes is, 
however, 53lLi where the Xi are independent geometric 
random variables that each describe the number of trials 
required to obtain only the i-th success [39]. That is X\ is 
the number of trials required to obtain the first success, X2 is 
the additional number of trials required to obtain the second 
success and so on. The probability that x — 1 trials fail before 
a success occurs at the x-th trial is P{Xi = x} = pq x " 1 for 
x > 1 and the corresponding MGF [40] becomes Mxi(^) = 
for qe° < 1. The MGF of X = £" =1 X, follows as 
(Mxi (#))" and with Chernoff's bound we estimate 



1 — qe 



— P\X > d + n} < — inf 

p n p n i?e[o,o< 

which is minimized at e 8 = — &hL. 
■d > 0. It follows that 



= -tf(d+n) 



(t 



pe 



qe L 



q{l+d/n) 



if d > rPSL to ensure 



l-q 



1 



-P{X>d + n}< 



q(l + d/n) 
d/n 

(l + d/n) 1+d / n 



(1 + d/n) r 



(d/n) d / n 

which provides an upper bound for l|9}. Assembling all parts 



we obtain 



M 



A0S( 



-d) < e e W)+™ ff <=W-(c-Pc(e))<2) 

■(l + d/n) 1+d l n \ n 



(d/n) d / n 

and the delay bound follows by insertion into (JSJl. ■ 

V. Numerical Results 

We provide numerical delay bounds for n servers with 
cross-traffic in series as shown in Fig. ^ The servers each 
have capacity C and the leftover service is determined under 
the general scheduling model. We use stationary, single leaky 
bucket traffic sources with maximum burst size b and rate r. 
For the MGF of such regulated sources, see for example [28], 
[16], it is known for > and t >s > that M A (9, s, t) < 
1 + b+r(t- s ) (e e ^ b+r ^- s ^ - 1). We compute stationary delay 
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Fig. 2. Utilization under aggregate scheduling 



bounds with violation probability e = 1CP 6 from Th. \l\ and 
Th. |2 and compare the results to respective bounds obtained 
from deterministic network calculus, that is for e = 0. Note 
that the probabilistic calculus recovers corresponding deter- 
ministic bounds for 9 — » 00, whereas the optimal choice of 9 
effectively exploits the statistical multiplexing gain. We show 
a significant statistical gain for aggregation of independent 
flows, for independent cross-traffic, and for concatenation of 
independent servers, where the linear scaling of performance 
bounds in the number of traversed servers n is confirmed. 

Fig. 12 shows results for a single server which, however, 
may be a lumped, equivalent system that represents an entire 
network. For ease of presentation homogeneous flows with 
maximum burst size 6 = 1 Mb and rate r = 30 Mb/s 
are used. Flows are added unless the target delay bound 
d < 10 ms is violated. Clearly with increasing number of 
independent flows the probabilistic calculus can efficiently 
exploit statistical independence, such that the utilization can 
be significantly enhanced. For the investigated scenario the 
average utilization of a 2.4 Gb/s link can be increased from 
0.29 to 0.71 by allowing for a small violation probability 
e = 10~ 6 of the target delay bound d = 10 ms. 

Fig. [3] shows the impact of high priority cross-traffic at 
a priority scheduler. A constant rate server with capacity 
C = 2.4 Gb/s is analyzed and delay bounds are derived for 
an aggregate of 20 independent, low priority flows, each with 
leaky bucket parameters 6=1 Mb and r = 30 Mb/s. At 
the same time rn independent, high priority flows with leaky 
bucket parameters b = 20/m Mb and r — 600/m Mb/s are 
scheduled. Since the sum of the traffic parameters of all m 
high priority flows is constant, the deterministic delay bound 
does not depend on m. It is 22.3 ms, whereas the probabilistic 
bound with violation probability e = 10~ 6 decreases from 
15.3 ms for m = 1 and approaches 4.4 ms for large m. 

Two effects which are due to statistical multiplexing can 
be itemized here: For m — 1 the difference between the 
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Fig. 3. Impact of high priority cross-traffic 



Fig. 5. End-to-end concatenation of servers in series 



deterministic and the probabilistic delay bound is almost 
entirely due to the statistical multiplexing gain among the 
20 low priority flows. For an increasing number of high 
priority flows an additional, important statistical multiplexing 
gain among the m high priority flows and between the two 
priority classes can be made accessible. For comparison the 
dotted lines at 2.9 ms and 4.4 ms show the probabilistic delay 
bound without cross-traffic and with constant bit rate cross- 
traffic respectively. Clearly as the number of independent high 
priority flows increases the aggregate of the high priority flows 
becomes smoother and for large m the probabilistic delay 
bound approaches the bound obtained under constant bit rate 
cross-traffic. 

The statistical multiplexing gain becomes more and more 
apparent, if the number of flows is increased. To this end 
Fig. |4] shows results for the same scenario used to obtain 
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Fig. 4. Impact of the network load 



Fig. |5] however, applying through and cross flows with burst 
size b = 0.1 Mb and rate r = 3 Mb/s. The number of 
through and cross flows are equal and are denoted by m. 
For the investigated 2.4 Gb/s link m = 400 corresponds 
to 400 through flows and 400 cross flows and results in a 
sustained network load of one. Fig |4] compares deterministic 
and probabilistic delay bounds for varying m respectively 
varying network load. Clearly, if the utilization is very low 
there are few flows and only a small statistical multiplexing 
gain can be realized. Thus, the deterministic and probabilistic 
delay bounds are almost equal. On the other hand, if the 
utilization is very high, large delays are seen more frequently 
and the probabilistic delay bounds grow rapidly. Eventually, 
for a load of one the probabilistic calculus does not provide 
a finite delay bound, whereas the deterministic calculus still 
does. Between these extreme points the probabilistic calculus 
effectuates a significant statistical multiplexing gain resulting 
in a considerable reduction of delay bounds compared to the 
deterministic case. 

Fig. 13 shows results for n servers with cross-traffic in 
series. At each server the current cross-traffic is de-multiplexed 
and fresh, independent cross-traffic is multiplexed. The same 
parameters as used for Fig. [3] are applied and m = 20 is 
chosen. The results confirm the scalability of the approach. 
Both the deterministic and the probabilistic bounds scale in 
0(n), where n is the number of servers in series. Thus, 
the important end-to-end scalability of deterministic network 
calculus has been achieved using a probabilistic calculus which 
effectuates a noticeable statistical multiplexing gain. 

Further on, the so-called Pay Bursts Only Once phe- 
nomenon [2] known from deterministic network calculus can 
be observed in a probabilistic setting. In brief this phenomenon 
describes the observation that the increment of the end-to-end 
delay bound for each additional, concatenated server is smaller 
than the delay bound that would be obtained for the server in 
isolation. 



VI. Conclusions 

We presented an end-to-end probabilistic network calculus 
with moment generating functions which efficiently utilizes 
statistical independence while preserving the intuitive convo- 
lution formulas and the fundamental concatenation theorem of 
deterministic network calculus. The derived framework meets 
the benchmark set by the deterministic calculus and achieves 
a linear scaling of end-to-end, per-flow backlog and delay 
bounds in the number of traversed servers. 

We showed how moment generating functions can effi- 
ciently be used in the context of network calculus to ex- 
ploit independence beyond statistical multiplexing of flows. A 
relevant gain is obtained for different scheduling disciplines, 
such as priority or first-in first-out scheduling, and for the 
concatenation of servers. We presented numerical results for 
tandem servers that support the efficiency of the approach. 
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